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We mathematize El Farol bar problem and transform it into a workable model. In general, the 
average convergence to optimality at the collective level is trivial and does not even require any 
intelligence on the side of agents. Secondly, specializing to a particular ensemble of continuous 
strategies yields a model similar to the Minority Game. Statistical physics of disordered systems 
allows us to derive a complete understanding of the complex behavior of this model, on the basis of 
its phase diagram. 
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INTRODUCTION 

Statistical mechanics has developed powerful tools to 
tackle analytically disordered systems with many degrees 
of freedom. These tools were recently shown to be appli- 
cable to systems of inductive heterogeneous agents such 
as the Minority Game 0, 0, 0, 0] - Even if the latter is 
inspired by El Farol's bar problem [f|, the literature on 
these two models are rather separate. In particular what 
the MG has brought to the understanding of the El Farol 
problem as defined by Arthur is not clear. Here we show 
that all results known about the MG directly apply to 
the El Farol problem. 

In the El Farol's bar problem, N — 100 customers 
have to decide independently whether to go or not to the 
bar, which has a capacity of L = 60 seats, the resource 
level. Attending when the bar is crowded is not enjoy- 
able. Customers are inductive rational agents. They use 
simple predictor rules, based on the past attendances, 
to predict whether the bar will be crowded or not, and 
behave accordingly. 

One important issue is whether the customers, that do 
not communicate with each other, are able to synchronize 
their actions so that the attendance A is on average equal 
to the resource level L. The main result of Arthur is that 
agents need not be endowed by a sophisticated deduc- 
tive rationality in order to synchronize. Even inductively 
rational agents can do. This results is probably respon- 
sible for the large interest his model aroused |6j. Here 
we show that even inductive rationality is not necessary 
because even zero-intelligence agents, acting as simple 
automaton, are able to self-organize to the comfort level. 
The convergence of the average attendance to the com- 
fort level L is trivial under very generic and reasonable 
conditions. The really non-trivial question is whether 
agents are able or not to reduce stochastic fluctuations 



of the attendance A around L. The Minority Game was 
introduced exactly to address this question, though 
in a simplified model. In what follows, by focusing on 
a particular ensemble of strategies for the El Farol bar 
customers, we derive a model for which we can use all 
the machinery used in the theory of Minority Games to 
derive a complete picture of the El Farol bar problem. 

The main results are that there is an optimal com- 
plexity of the strategies which agents should consider, 
depending on their number N. More precisely, for a 
fixed ratio I = L/N of seats to agents, coordination 
is optimal when agents use predictor strategies based 
on vfi ~ ol c log 2 N values of the past attendance. If 
to <C a c log 2 ./V crowd effects occur whereas when m 3> 
a c log 2 N the information on past attendance is way too 
redundant. 

We also show the importance for the agents to use con- 
sistent predictors. On the one hand, a small systematic 
bias affects considerably the results. On the other, an El 
Farol problem with consistent strategics is equivalent to 
a Minority Game. This implies that a large part of liter- 
ature on the latter model is also directly relevant to the 
study of the former without any modification. Inconsis- 
tent strategies correspond to biased strategies, or equiv- 
alently to a tunable resource level, in Minority Games. 
Such issues were investigated numerically in Refs. 0,0113 
with numerical simulations. 

A MATHEMATICAL FORMALISM FOR EL 
FAROL BAR PROBLEM 

In Arthur's paper, each customer uses the public 
knowledge of past m weeks' attendance 

l t = {A(t-m),A(t-m - !),••• ,A(t- 1)} 
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in order to determine whether to go the the bar, or to 
stay at home. She crafts a prediction of the next atten- 
dance A(t). If her prediction is larger than the resource 
level, she stays at home, else she goes to the bar. The 
learning procedure is inductive: she has a personal set 
of S fixed strategics based on S different attendance pre- 
dictors Ai t i, ■ ■ ■ ,Ai,s- These are functions mapping the 
information 2 about the past m attendances to the in- 
teger prediction Ai S (T) £ [0, N] of the next attendance 
A(t). 

Each attendance predictor Ai lS s — I,-- - ,S should 
not be rewarded depending on their precision but rather 
on the payoff they give to an agent who follows their 
advice. So if A(t) = 59, a prediction of 5 is better that a 
prediction of 61. More precisely, every predictor s has a 
score Ui :S associated to it, that evolves according to 

Ui,,(t + l)=U hS (t) + e{[A, s (2t) - L][A(t) — L]}, 

where Q(x) is the Heaviside function [0(x) = for x < 
and Q(x) = 1 for x > 0], 

N 

A(t) =^ai(t), 
»=i 

is the attendance at time t and aiit) is the choice of 
customer i at time t, which is determined by following 
her best predictor at that time. Mathematically, 

Si(t) = argmaxZ^. s /(t) 

s' 

and 

a i {t) = Q[L-A iMt) {l t )]- 

This is the setup of the game proposed by Arthur [l9j . 
He did not specify what the predictor space was, but gave 
only a few examples of predictors, such as the average of 
the last attendance numbers A(T t ) — 1/m Y^v=i 4(i ~~ 
t'), or a mirror number of the attendance of t — 3, i.e., 
A(T t ) = L — A(t — 3). Fogel et al 0] used auto-regressive 
functions A(It) — Y^k=i fkA(t — k), where ft are real 
numbers. 

Specifying the ensemble of predictors from which 
agents draw strategies is a key issue in the El Farol bar 
problem. 

Predictors and strategies 

In general, a predictor is a function from 2 £ [0, N] m 
to A £ [0,N]. There are (N + l)( Ar + 1 )'" different such 
functions. A strategy instead, is a function ail) from the 
set of possible informations 2 £ [0, N] m to an action a £ 
{0, 1}. Each strategy can be be considered as the result 
of the prescription of a predictor: a(T t ) = Q[L — A(T t )]. 



However, simple counting shows that there are 
2( Ar + 1 ) possible strategies, which is way smaller than 
the number of predictors for large TV. Hence many dif- 
ferent predictors A{I) correspond to the same strategy 
a(X t ). For a particular strategy a(I) with J2i a (-E) = 
there are 

Af(a) = (N-LfL^ +1 ^- K (1) 

predictors A which are consistent with that strategy. 

Eq. implies that not all strategies are equivalent, in 
principle. In order to illustrate this point, let us consider 
the case of a strategy resulting from a predictor taken 
at random. When N,L 3> 1, we almost surely pick a 
strategy which prescribes to go a fraction L/(N + 1) of 
times. Indeed by Eq. (|TJ, almost all strategies have this 
property. 

As a byproduct we see that Arthur's result that the 
attendance self-organizes to the comfort level L is trivial 
if agents draw predictors uniformly and at random from 
the whole predictor space. The attendance self-organizes 
to the comfort level L for the simple reason that agents 
following predictor strategies will attend with a proba- 
bility L/(N+l). 

Actually it is desirable to restrict the ensemble of 
predictors from which agents draw, to those having 
some consistency and continuity properties. Consistency 
means that the predictor should be consistent with past 
observations: for example if A(t — k) fluctuates around 
some value L a consistent predictor would also have A ~ 
L. A predictor of the Fogel type A(T) — J2kLi fkA(t—k) 
should be such that 2fe=i fk = lj else it would predicts 
systematically an attendance which is larger than the 
true one, hence, not be consistent. 

A minimal requirement of consistency is that the re- 
sulting strategies be unbiased, i.e. that "on average" 
they prescribe to attend a fraction L/(N + 1) of the 
times. A mathematical formalization of this property 
entails non-trivial considerations and it will not be pur- 
sued here jl^ ■ Rather we shall later introduce explicitly 
the bias of strategies as an external parameter and study 
the collective behavior as a function of it. 

Strategies derived from a random predictor, are un- 
biased but fail to have a minimal degree of "continu- 
ity". Loosely speaking, continuity of A means that if 
the change in the information X is small, the change in 
the prediction A(T) 1 or at least in the prescribed action 
0[L — .4(2")] should also be small (or rare). For exam- 
ple, the action prescribed by the strategy should change 
only rarely when the attendance of a past day changes by 
a small amount. Considering that past attendance may 
also be subject to observation errors, continuity is a quite 
desirable robustness property of strategies. 
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A workable model 

In the following we focus on a particular ensemble of 
strategies, which is obtained by reducing the information 
space. The intuition is that what is really telling about 
the value of a past attendance A(t — k) is whether that 
was below or above the comfort level L, which is a binary 
information. In other words we consider strategies 

a^ = e[L-A s Al)} (2) 

which depend only on the information 

At(i) = {9[L - A(t - 1)], . . . , Q[L - A(t - to)]}. (3) 

Clearly strategies derived in this way have a high degree 
of continuity. 

Not all strategies derived by a predictor a T si = Q[L — 
A S} i(T)} are of the form described above. There are only 
2 2 strategies of this type, which means that a reduction 
of the information space also implies a strong reduction 
of the strategy space |2lj ]. 

The fact that agents use a strategy space whose size is 
independent of N makes much sense. In context where 
agents interact with a crowd, their behavior is insensitive 
to the exact size N of the population. 

Henceforth we assume that each agent is assigned S 
strategies randomly drawn from this pool. More pre- 
cisely, for each i, s and /i we draw a^ 1 i independently from 
the distribution 

P(a) = Prob{a^ = a} = aS(a - 1) + (1 - a)S(a). 

As explained above, the induced strategy space of the 
El Farol problem is such that on average, agents attend 
the bar with a frequency L/N. In other words, binary 
strategies of the El Farol bar problem account a priori 
for the convergence of the attendance to L on average. 
Hence we shall consider below an ensemble of strategies 
such that the average of i is a « L/N. Actually we 
shall see that small deviations of a from L /N may have 
a large effect in the limit N — > oo. 

A further simplification, which does not change the 
qualitative nature of the results 01 amounts to consider 
a linear dynamics of the strategy scores 

u i)S (t + 1) = M*) - ( 2 <i* } - iM*) - L l ( 4 ) 

Note that the strategies that predict the correct choice 
are rewarded whereas those prescribing a wrong choice 
are punished. 

The understanding of the behavior of this model is 
made complex by the feedback of the fluctuations of A(t) 
with the dynamics of information (i{t). Notice that Eq. 
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FIG. 1: Behavior of the average attendance (top) and of the 
fluctuations (bottom) in the El Farol bar problem with L = 60 
seats, a — 1/2 and m = 2, 3 and 6 from left to right. In 
both cases, the solid (dotted) line refers to true (random) 
information. 

(J3J) is equivalent to assuming that /i(t) follows the non- 
linear dynamics 

M(t+l) = |2A»(t) + e[i-A(t)]| am (5) 

where | • ■ ■ |p is the modulus P operation. The behavior 
of the Minority Game is largely unaffected if this dy- 
namics is replaced by a random draw of (j,(t) 0,0- As 
we shall see this is not the case in our case. Still it is 
very helpful to introduce a variation of the model with 
random information, where /i(t) is just randomly drawn, 
with uniform probability, from the integers 1, . . . , P = 2™ 
(if the history is random, P can be any integer number). 
This is because the model with random information can 
be understood in detail within a theory such as that de- 
veloped for the MG. This is a quite useful intermediate 
step toward understanding the behavior of the El Farol 
bar problem with true information. In addition it is also 
possible to quantify the effects of the dynamics of true 
information Eq. J5J along the lines of Ref. ^3 ■ 

In order to illustrate the behavior of the model, Fig. 
n shows the results of simulations with L = 60, a = 1/2 
and to = 2, 3, 6 (from left to right) fixed as a function of 
N. This shows what happens in a system where the "en- 
vironment" (L) and the adaptive capabilities of agents 
(a and m) are kept fixed, while the number N of agents 
increases. The top graph shows the deviation (A) — L of 
the average attendance from the comfort level. As we see 
when Na w L the attendance converges to the comfort 
level (A) ps L. However, for small to there is a whole 
interval around Na = L where agents are still able to 
coordinate efficiently (A) w L. For m = 2, the results 
are qualitatively the same with true and random infor- 
mation. For larger values of to the region where (A) ~ L 
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shrinks. In addition, while (A) maintains a monotonic 
behavior with random information, it develops a maxi- 
mum and minimum for intermediate values of aN/L be- 
yond which the behavior with true information markedly 
departs from that with random information. 

It is important to quantify the model's behavior also 
beyond the properties of (A) . Indeed, we have seen that 
convergence to the comfort level is a trivial result in the 
limit N — > oo. It is a built-in property of the model 
which arises from the requirement of unbiased strategies 
(o s» L/N). 

The non-trivial cooperative behavior of this system, 
as forcefully remarked by the literature on the Minority 
Game, lies in how and whether agents manage to decrease 
fluctuations of the attendance A(t) around the comfort 
level L. Indeed even if A(t) equals L on average, the dis- 
tance |A(t)— L\ measures the amount of wasted resources, 
either unexploited Ait) < L or over-exploited A(t) > L. 
Therefore, the quality of the cooperation among agents 
is measured, at a finer level, by the fluctuations around 
the resource level, defined as 



0.5 



((A-L) 2 ) 



(6) 



where (. . .) is the average on the stationary state. Given 
that A(t) is the sum of N contributions, it is natural to 
study the quantity a 2 /N which, as we shall see has a 
finite limit value in the limit N — > oo. 

The behavior of a 2 /N is shown in Fig. ^ Away from 
the point aN = L, the increase of a 2 /N is mainly due to 
the deviation (A) — L and as before, it differs in the cases 
of true and random information. For small m, a 2 /N dis- 
plays a maximum at aN = L which becomes shallower as 
m increases and it disappears for m = 6. This non-trivial 
behavior suggests that a small bias in the strategies, of 
either sign, is beneficial as it decreases the fluctuations 
(see also gf). 

This is evident from Fig. [3 where we compute the 
fraction (\A — L\)/N of unsatisfied agents and compare 
it with the behavior of agents who attend the bar at ran- 
dom, either with probability a or with probability L/N. 
Here we clearly see that in the region aN « L adaptive 
agents behave less efficiently than random agents. This 
effect is related to the emergence of fluctuations and is 
stronger for small values of m. Efficiency increases for 
larger values of m. 

We shall devote the rest of this paper to explain 
the non-trivial behavior displayed by (A) and a 2 . The 
first step will be extending the analytic approach of 
Refs. |2l lUj to the model with random information, 
which is essentially equivalent to a Minority Games with 
biased strategies and tunable resource level. Then we 
shall analyze the case with true information. 
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FIG. 2: Fraction of losers in the El Farol bar problem with 
L — 60 seats, a = 1/2 and m = 2 for random (o) and true (•) 
information. The results for a population of random agents 
who attend the bar independently with probability a and L/N 
is also shown for comparison (dashed and dotted lines) . 



STATISTICAL MECHANICS OF THE EL FAROL 
BAR PROBLEM WITH RANDOM 
INFORMATION 

Following Ref one deduces that agent i ends up 
playing strategy s with frequency /,. s that minimizes the 
quantity 

p 

H = Y J P l/ ({A\v)-L) 2 
where p v = Prob{/i(i) = v) = 1/P and 

N S 

^ = E E /-<<- 

i=l Si =l 

is the average of A(t) conditional on the event /z(t) = v. 
This result can be obtained in a straightforward manner 
by taking the average of Eq. Q in the stationary state, 
and comparing the resulting equations with the first or- 
der conditions of the minimization of H with respect to 

fs,i • 

The function H measures the predictability in the sys- 
tem, i.e. the amount of useful information about the 
fluctuations of the attendance which is left in the signal 
/i(i). Indeed if e.g. {A\u) ^ L, the signal n(t) carries in- 
formation which is useful to predict whether one should 
attend or not to the bar when fi(t) = v. The fact that the 
stationary state corresponds to minimal H means that 
agents exploit to their best the system's predictability. 

In terms of statistical mechanics, H can be considered 
as a Hamiltonian and its minima can be studied with 
standard methods. As long as H > 0, the stationary 
state is unique, and the replica trick [14( gives exact re- 
sults 



is unique 



For the sake of simplicity, we will focus 
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on the S — 2 case (see Ref. [l5| for a generalization to 
S > 2). The details of the calculus are of no special 
interest, as they mostly replicate previously published 
calculations 0- 

We shall consider the thermodynamic limit N — > oo 
with 



fixed 0| . Furthermore, in order to study the effect of 
a deviation of a around t we introduce the convenient 
parameter 7 with the equation: 



£{!-£) 



-7 



(7) 



and we shall consider 7 finite in the limit N — > 00 . As we 
shall see this is a non-trivial limit. For example, the case 
where a — £ w 0(1) is finite, is trivial because then each 
agent uses just one strategy, that which prescribes to go 
more (less) often if a < I (a > £) and there is no dynamics 
at all. Eq. J2J implies that we consider small deviation of 
a = £+0(l/VN) from the correct value t = L/N. Even 
such a small deviation, which vanishes as N — ► 00, has 
a finite effect on the global behavior as we shall see. At 
any rate, other limits can be obtained analyzing the cases 
where either t, a or 7 vanish or diverge, in the analysis 
that follows. 

After some routine calculations, we find that the pre- 
dictability H/N is given by 



h .y V^M) l + Q(C) + 2 7 2 /a rsl 

and the fluctuations, as long as the stationary state is 
unique (see later), are equal to 



a 2 = H + N ^ £{1 2 e) [1 - Q(C)] + NZ. (9) 
In these two equations, Q and \ are given by 



x(C) 



erf(C/v / 2) 



f 



(10) 



whereas the parameter £ is uniquely determined by the 
transcendental equation 



£ -Q(C)-l-2^ = 



(11) 



as a function of a and 7. Finally £ in Eq. is a 
term which arises from collective fluctuations and its cal- 
culation requires, in principle, a detailed theory of the 




FIG. 3: Phase diagram of the El Farol bar problem. The 
dashed lines correspond to the trajectories of systems with 
L = 60, a = 1/2 and m = 2, ... ,6 as the number of agents 
increases (from bottom to top). The dot-dashed line corre- 
sponds to a typical trajectory of a system with fixed L, N 
and a > L/N as the agents' memory changes. 



stochastic dynamics of the model (see Ref. |llj) which 
we shall not pursue here, but whose importance is dis- 
cussed below. 

When 7 = 0, these equations are identical to those 
which describe the MG behavior. We briefly recall the 
resulting picture: When a is large, the system is in an in- 
formation rich phase with positive predictability H > 0. 
The predictability H decreases as a decreases. This can 
be understood by observing that, at fixed P = 2 M , a 
decrease in a means that the number of agents increases, 
and hence their ability to exploit the information. At 
a critical value a c = 0.3374 . . . the predictability van- 
ishes and a phase transition to a symmetric phase with 
H = takes place. The fact that H measures an asym- 
metry in the signal means that the phase transition is 
related to symmetry breaking |l2j . The phase transition 
is signaled by the divergence of the spin susceptibility Xj 
which is infinite in the whole symmetric phase. The sta- 
tionary state is unique, independent of initial conditions, 
for a > a c and these facts conspire 

.11] 

in such a way 

that X = for a > a c . On the contrary, for a < a c , the 
stationary state is not unique but it depends on the initial 
conditions. Then S > can be computed within a very 
accurate self-consistent approximation |ll| . In particular 
this shows that E oc 1/a for small a. 

Therefore, the El Farol bar problem, with the strategy 
ensemble studied here, and the MG have the same behav- 
ior of fluctuations when 7 = 0, that is, when Na = L. 
When 7^0 the picture changes in the following way: 
First we observe that all quantities depend on 7 2 , then 
it is enough to consider only the case 7 > since all 
conclusions extend directly to the case 7 < 0. When 
7 is small, we still have a phase transition at the point 
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a c (j) = erf(Cc/V2) where x 
the solution of 



=^-«o-i 



oo. The parameter £ c is 



2 7 2 



( 2 



erf(C/V2) 



(12) 



Figure |3 plots the phase diagram of the game. The criti- 
cal line separates the asymmetric phase (H > 0) from 
the symmetric phase (H = 0). It crosses 7 = at 
a c (0) = 0.3374 . . ., the critical point of the standard 
MG 0, U when 7 increases a c decreases and a c = 
f or 7 > 1 / y/n. 

The meaning of the phase diagram is clear: Indeed 
H = implies (A) = L. The symmetric phase is the 
region of parameters where the average attendance con- 
verges to the comfort level. This region is also character- 
ized by large collective fluctuations a 2 and by a depen- 
dence on initial conditions; in particular, the fluctuations 
decrease if the difference of strategy a priori valuation in- 
creases as discussed in the MG literature 0, 0, 0, 0] . 
On the contrary, there is no equality between (A) and L 
in the asymmetric phase if 7 > 0. 

These results explains the complex behavior reported 
111 Fig. □ Indeed as N varies with L, a and m fixed, 
the system follows the trajectories shown in Fig. [21 For 
small values of m these cross the symmetric phase in the 
region aN « L. Fig. 0] indeed shows that H computed 
along these trajectories fully agrees with the theoretical 
results (for random information). The symmetric phase 
is characterized by large fluctuations, mainly due to dy- 
namic fluctuations (the term E). This explains the non- 
monotonic behavior of a 2 in Fig. ^for small values of m. 
When to increases, the trajectory in Fig. moves toward 
larger values of a and, for m > m c , it remains all in the 
asymmetric phase. Then E = 0, which means that a 2 
displays a single minimum at aN = L (i.e. 7 = 0). 

In spite of the fact that the theory is derived in a par- 
ticular N — > 00 limit, our results show that it reproduces 
accurately results for moderately small values of N. At 
the same time, it clearly predicts how the collective be- 
havior depends on the parameters N, L, a and m. Any 
"experiment" where one of these parameters is changed, 
corresponds to a precise trajectory in the (a, 7) phase 
diagram and a corresponding collective behavior. For 
example, for fixed m and i = L/N, anomalous fluctua- 
tions will arise in an interval of size \j\[P = 2~ m l 2 as 
a changes around I along a vertical trajectory in Fig. 
The memory size controls fluctuations. Indeed generally, 
as P increases, keeping all other parameters fixed, the 
systems moves away from the symmetric phase (see dot- 
dashed line in Fig.|3). 

It is precisely at the boundary of the two phases that 
coordination is most efficient. This means that there is 
an intermediate memory length which is optimal for the 
collective behavior. 




aN/L 



FIG. 4: H/N in the El Farol bar problem with true infor- 
mation and L = 60, a = 1/2 and m = 2,3,4 and 6 as a 
function of N. Theoretical results (full lines) fully agree with 
numerical simulations (symbols). 
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FIG. 5: Entropy S m /m of the El Farol bar problem with true 
information and L = 60, a — 1/2 and m = 2, 3, 4 and 6 as a 
function of N. Vertical lines delimit the symmetric phase of 
the corresponding model with random information. 



TRUE INFORMATION 

The behavior of the model with true information devi- 
ates from that with random information because, under 
the dynamics Eq. JSJ, the space of informations is not 
sampled uniformly. More precisely, if is the probabil- 
ity of information fi in the stationary state, we can make 
this statement quantitative introducing the entropy 



2 m 

£ 



p"log 2 p* 



When the space of information is sampled uniformly — 
l/2 m we find S m = m, whereas S m = when only one 
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value of [i is sampled recursively. Fig. [3] shows that 
S m ~ m only occurs in the region where H ~ in the 
corresponding model with random information, i.e. in 
the symmetric phase. This is consistent, because if H = 
then the process of Eq. (JSJ is a simple diffusion on the 
so-called De Bruijn graph. We refer the interested reader 
to Ref. 01 f° r a detailed account of this process. Here it 
is sufficient to observe that if (A\fi) = L for all /i, thenj^l 

Prob{/x(t+l) = |2/i(t) + l| 2m } = i 

If this is the case, the stationary state probability p M = 
l/2 m is uniform [12j. When H > 0, for a particular value 
of /i, we expect that L — A{t) will take more frequently 
one sign or the other. Hence Eq. (JSJ will induce a biased 
diffusion process on n(t). In particular, for aN < L, the 
attendance will be more often below the comfort level 
than otherwise. This means that l's will occur more 
often that 0's in Eq. 10 for n(t). It is easy to check that 
a systematic bias of this type, produces a distribution p M 
which is concentrated on fi = {111 . . .} (using the binary 
representation). Likewise p M is peaked on p = {000 . . .} 
when aN > L 23]. 

Hence outside the symmetric phase, when H > 0, the 
process fi{t) acquires a bias, which reduces the "effective 
number of information patterns" to a number 2 Sm . In 
order to understand how this changes the collective be- 
havior of players, imagine the extreme case S m = where 
for some reason, the state fj,(t) — {000 . . .} = occurs for 
a large number of periods. Then agents will learn how to 
respond optimally to this state fi = 0. 

There are N ++ = a 2 N agents with — a° 2 = 1- 
They will go anyway. There are iV__ = (1 — a) 2 N agents 
with cq 1 — a® 2 = who will not go. The remaining 
2a(l — a)N can decide. Agents can learn to converge to 
A(t) = L provided that N ++ < L < N - N , i.e. if 

L Ar L 
< N < — . 

1 - (1 - a) 2 " " a 2 

Furthermore, if N > L/a, in particular if it is close to 
the upper limit L/a 2 , the information /i = will arise 
very frequently from the dynamics Eq. JSJ. The same 
argument runs for N < L/a and it shows agents can 
coordinate quite efficiently when N is close to L/[l(l — 
a) 2 ], because then the information /j, = {111...} will 
almost always occur. 

This complex interaction between the dynamics of A(t) 
and fi(t) explains the non-monotonic behavior of (A) — L 
in Fig. |T] 

CONCLUSIONS 

We have presented a complete theory of the El Farol 
bar problem. The key issue lies in the definition of the 



strategy space. First we have shown that, for the most 
general ensemble of strategies, convergence of the atten- 
dance to the comfort level is a trivial consequence of the 
law of large numbers. It does not even require inductive 
rationality. Even zero-intelligent agents are able to reach 
it. This is likely to be true for any reasonable predictor 
based strategy, in particular for unbiased ones. 

We further focus attention on a particular ensemble 
of strategies, with a desirable continuity property. This 
leads us to study models very similar to the Minority 
game. We first introduce the random information ver- 
sion of the game, for which statistical physics provides a 
complete theoretical understanding and analyze the con- 
sequences of the dynamics of true information. 

It turns out that as the parameter of the El Farol bar 
problem change, the system performs a trajectory on a 
phase diagram characterized by a symmetric phase where 
(A) = L and a phase where (A) ^ L. Deep in the sym- 
metric phase, anomalous fluctuations similar to crowd 
effects, develop making the coordination of agents even 
worse than that of random agents in some cases. It is 
precisely close to the phase boundary that agents manage 
to coordinate most efficiently. This, however, requires a 
small bias of either sign, in the strategies of agents. 

These findings not only confirm that the El Farol bar 
problem is indeed a quite interesting complex system. 
But they also show that a coherent understanding of its 
behavior is possible, using tools of statistical physics. 
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